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Abstract 

Results are presented from searches for the standard model Higgs boson in proton- 
proton collisions at = 7 and 8TeV in the CMS experiment at the LHC, using 
data samples corresponding to integrated luminosities of up to 5.1 fb^^ at 7TeV and 
5.3 fb^^ at 8TeV. The search is performed in five decay modes: 77, ZZ, WW, t+t^, 
and bb. An excess of events is observed above the expected background, a local signif- 
icance of 5.0 standard deviations, at a mass near 125 GeV, signalling the production 
of a new particle. The expected significance for a standard model Higgs boson of 
that mass is 5.8 standard deviations. The excess is most significant in the two decay 
modes with the best mass resolution, 77 and ZZ; a fit to these signals gives a mass of 
125.3 ± 0.4 (stat.) ± 0.5 (syst.) GeV. The decay to two photons indicates that the new 
particle is a boson with spin different from one. 
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1 Introduction 

The standard model (SM) of elementary particles has provided a remarkably accurate descrip- 
tion of results from many accelerator and non-accelerator based experiments. The SM com- 
prises quarks and leptons as the building blocks of matter, and describes their interactions 
through the exchange of force carriers: the photon for electromagnetic interactions, the W and 
Z gauge bosons for weak interactions, and the gluons for strong interactions. The electromag- 
netic and weak interactions are unified in the electroweak theory. Although the predictions 
of the SM have been extensively confirmed, the question of how the W and Z gauge bosons 
acquire mass whilst the photon remains massless is still open. 

Nearly fifty years ago it was proposed [1 -6J that spontaneous symmetry breaking in gauge 
theories could be achieved through the introduction of a complex scalar doublet field. Ap- 
plying this proposal to the electroweak theory 13^91 leads to the generation of the W and Z 
masses, and to the prediction of the existence of the SM Higgs boson (H). The scalar field also 
gives mass to the fundamental fermions through the Yukawa interaction. The mass Wh of the 
SM Higgs boson is not predicted by theory. However, general considerations IITOtiTSlI suggest 
that mn should be smaller than ~1 TeV, while precision electroweak measurements imply that 
wjh < 152 GeV at 95% confidence level (CL) llT4t . Over the past twenty years, direct searches 
for the Higgs boson have been carried out at the LEP collider, leading to a lower bound of 
niu > 114.4 GeV at 95% CL ||15|, and at the Tevatron proton-antiproton collider, excluding the 
mass range 162-166 GeV at 95% CL Ill6l . 

The discovery or exclusion of the SM Higgs boson is one of the primary scientific goals of 
the Large Hadron Collider (LHC). Previous direct searches at the LHC were based on data 
from proton-proton collisions corresponding to an integrated luminosity of 5fb^^ collected at 
a centre-of-mass energy ^/s = 7 TeV. The CMS experiment excluded at 95% CL a range of 
masses from 127 to 600 GeV IITtI . The ATLAS experiment excluded at 95% CL the ranges 111.4- 
116.6, 119.4-122.1 and 129.2-541 GeV ||18| . Within the remaining allowed mass region, an excess 
of events near 125 GeV was reported by both experiments. In 2012 the proton-proton centre-of- 
mass energy was increased to 8 TeV and by the end of June an additional integrated luminosity 
of more than 5 fb^^ had been recorded by each of these experiments, thereby enhancing signif- 
icantly the sensitivity of the search for the Higgs boson. 

This paper reports the results of a search for the SM Higgs boson using samples collected by 
the CMS experiment, comprising data recorded at i/s = 7 and 8 TeV. The search is performed 
in five decay modes, H — > 77, ZZ, WW, t+t^, and bb, in the low-mass range from 110 up 
to 160 GeV. In this mass range the Higgs boson production cross section is predicted to range 
from 21 (27) to 12 (15) pb at = 7 (8) TeV The natural width of the SM Higgs boson in 
this mass range is a few MeV and the width of any observed peak would be entirely dominated 
by instrumental mass resolution. In what follows, i stands for electrons or muons, H — )■ t+t^ 
is denoted as H — > tt, H — > bb as H — > bb, etc. For the final states ZZ and WW in the low-mass 
region, one or more of the Z or W bosons is off mass shell. 

With respect to the published analyses ||20V|24| , most analyses have been re-optimized, incor- 
porating improvements in reconstruction performance and event selection, and mitigating the 
more challenging conditions due to the higher LHC intensities in 2012. The new analyses pre- 
sented herein, of 8 TeV samples, and of 7 TeV samples featuring modified event selection crite- 
ria, were performed in a "blind" way: the algorithms and selection procedures were formally 
approved and fixed before the results from data in the signal region were examined. In the 
previously published analyses similar but less formal procedures were followed. 
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2 The CMS experiment 



Within the context of this search for the SM Higgs boson, we report the observation of an excess 
of events above the expected background, consistent with the production of a new particle with 
mass near 125 GeV. The observed local significance is 5.0 standard deviations (cr), compared 
with an expected significance of 5.8 cr. The evidence is strongest in the two final states with the 
best mass resolution, namely H — > 77 with a significance of 4.1 ct" and H — > ZZ (with the Z 
bosons decaying to electrons or muons) with a significance of 3.2 cr. The decay to two photons 
indicates that the new particle is a boson with spin different from one. 

2 The CMS experiment 

The possibility of detection of the SM Higgs boson played a crucial role in the conceptual design 
of the CMS experiment as a benchmark to test the performance of the detector ["251-127). Since the 
SM Higgs boson mass is not predicted by theory and its production cross section and natural 
width vary widely over the allowed mass range, a search was envisaged over a large range 
of masses and in diverse decay modes: pairs of photons, Z bosons, W bosons, t leptons, and 
b quarks. Planning in view of the analysis of all these channels ensured a detector capable of 
observing a Higgs boson over a broad mass range and able to detect most potential signals of 
new physics. 

The central feature of the CMS apparatus [281 is a superconducting solenoid of 6 m inter- 
nal diameter, which provides a magnetic field of 3.8 T. Within the field volume are a sili- 
con pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and 
a brass /scintillator hadron calorimeter (HCAL). Muons are measured in gas-ionization detec- 
tors embedded in the steel flux-return yoke. Extensive forward calorimeters complement the 
coverage provided by the barrel and endcap detectors. 

Charged particles are tracked within the pseudorapidity range |//| < 2.5, where 
rj = — ln[tan(0/2)], and 6 is the polar angle measured from the positive z axis (along the 
anticlockwise beam direction). The silicon pixel tracker comprises 66 million 100 x 150 //m^ 
pixels, arranged in three barrel layers and two endcap disks at each end. The silicon strip 
tracker, organized in ten barrel layers and twelve endcap disks at each end, comprises 9.3 mil- 
lion strips with pitch between 80 and 180 fim, with a total silicon surface area of 198 m^. The 
tracker has a track-finding efficiency larger than 99% for muons with transverse momentum, 
Pj, greater than 1 GeV, a transverse momentum resolution between 2 and 3% for charged tracks 
of Pj ~ 100 GeV in the central region (|// 1 < 1.5), and a very good b-jet-identification capability. 
Measurements of the impact parameters of charged tracks and secondary vertices are used to 
identify jets that are likely to contain the hadronisation and decay products of b quarks ("b 
jets"). A b-jet tagging efficiency of more than 50% is achieved with a rejection factor for light- 
quark jets of ~200, as measured in tt events in data [29]. The dimuon mass resolution at the 
Y mass, dominated by instrumental effects, is measured to be 0.6% in the barrel region [30J, 
consistent with the design goal. 

The ECAL is a fine-grained hermetic calorimeter consisting of 75 848 lead tungstate crystals, 
arranged in a quasi-projective geometry and distributed in a barrel region (|?/| < 1.48) and two 
endcaps that extend up to \r]\ = 3.0. The front-face cross section of the crystals is 22 x 22 mm^ 
in the barrel region and 28.6 x 28.6 mm^ in the endcaps. Preshower detectors consisting of two 
planes of silicon sensors interleaved with a total of three radiation lengths of lead absorber are 
located in front of the endcaps. Electromagnetic showers are very narrow in lead tungstate 
(Moliere radius of 21 mm), helping in particle identification and in the implementation of iso- 
lation criteria. In the central barrel region the energy resolution of electrons that do not radiate 
substantially in the tracker material indicates that the resolution of unconverted photons is 
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consistent with design goals. For such photons the diphoton mass resolution is estimated to be 
1.1 GeV at a mass of 125 GeV. 

The HCAL barrel and endcaps are sampling calorimeters consisting of brass and scintillator 
plates, covering \r]\ < 3.0. Their thickness varies from 7 to 11 interaction lengths, depending 
on ?/; a scintillator "tail catcher" placed outside the coil of the solenoid, just in front of the in- 
nermost muon detector, extends the instrumented thickness to more than 10 interaction lengths 
everywhere. Iron forward calorimeters with quartz fibers, read out by photomultipliers, extend 
the calorimeter coverage up to \ f]\ = 5.0. 

Muons are measured in the range < 2.4, with detection planes based on three technologies: 
drift tubes (|f/| < 1.2), cathode strip chambers (0.9 < |?^| < 2.4), and resistive plate chambers 
(I //I < 1.6). The first two technologies provide a precise position measurement and trigger 
whilst the third one provides precise timing information as well as a second and independent 
trigger. The muon system consists of four stations in the barrel and endcaps, designed to ensure 
robust triggering and detection of muons over a large angular range. In the barrel region each 
muon station consists of twelve drift-tube layers, except for the outermost station, which has 
eight layers. In the endcaps, each muon station consists of six detection planes. The precision 
of the r-(p measurement is 100 jim in the drift tubes and varies from 60 to 140 Jim in the cathode 
strip chambers. 

The CMS trigger and data acquisition systems ensure that potentially interesting events are 
recorded with high efficiency. The first level (LI) trigger, comprising the calorimeter, muon, 
and global trigger processors, uses coarse-granularity information to select the most interesting 
events in less than 4 fis. The detector data are pipelined to ensure negligible deadtime up to 
a LI rate of 100 kHz. After LI triggering, data are transferred from the readout electronics of 
all subdetectors, through the readout network, to the high-level-trigger processor farm, which 
operates offline-quality reconstruction algorithms to decrease the event rate to around 0.5 kHz, 
before data storage. 

The CMS experiment employs a highly distributed computing infrastructure, with a primary 
Tier-0 centre at CERN, supplemented by seven Tier-1, more than 50 Tier-2, and many Tier- 
3 centres at national laboratories and universities throughout the world. The CMS software 
running on this high-performance computing system executes numerous tasks, including the 
reconstruction and analysis of the collected data, as well as the generation and detailed detector 
simulation of Monte Carlo (MC) event samples. 

3 Event reconstruction 

The CMS "particle-flow" event description algorithm II3T1I52I is used to reconstruct and identify 
each single particle with an optimized combination of all subdetector information. In this pro- 
cess, the identification of the particle (photon, electron, muon, charged hadron, neutral hadron), 
plays an important role in the determination of the particle momentum. The reconstructed par- 
ticles are henceforth referred to as objects. 

Jets are reconstructed by clustering the particle-flow objects with the anti-kj algorithm ||33| 
using a distance parameter of 0.5. Additional selection criteria are applied to each event to 
remove spurious features originating from isolated noise patterns in certain HCAL regions, 
and from anomalous signals caused by particles depositing energy in the silicon avalanche 
photodiodes used in the ECAL barrel region. The average number of pp interactions per LHC 
bunch crossing is estimated to be about 9 and 19 in the 7TeV (2011) and 8TeV (2012) data sets. 
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3 Event reconstruction 



respectively. Energy from overlapping pp interactions ("pileup"), and from the underlying 
event, is subtracted using the FastJet technique I|34ti36l , which is based on the calculation of 
the ^/-dependent transverse momentum density, evaluated on an event-by-event basis. 

The jet momentum is determined as the vector sum of all particle momenta in the jet. Jet en- 
ergy corrections are derived from simulation studies and from in situ measurements using the 
energy balance of dijet and Z/ 7+jet events 113 71 . These corrections are between 5% and 10% of 
the true momentum over the whole pj spectrum and detector acceptance. The jet momentum 
resolution achieved is cr{pj)/pj = 85%/ y^px/GeV © 4% for central jets. A selection is applied 
to separate jets originating in the primary interaction from those due to energy deposits as- 
sociated with pileup. The discrimination is based on the differences in the jet shapes, in the 
relative multiplicity of charged and neutral components, and in the fraction of transverse mo- 
mentum carried by the hardest components. Within the tracker acceptance the jet tracks are 
also required to be consistent with originating at the primary vertex. 

The missing transverse energy vector is taken as the negative vector sum of all particle trans- 
verse momenta, and its magnitude is referred to as E^^^^. The typical missing transverse energy 
resolution is around 0.5\/'LEj GeV ||38|, where EEj is the scalar sum of all particle transverse 
momenta in GeV. 

The energy deposited in the ECAL is clustered both with general clustering algorithms f39] and 
with algorithms that constrain the clusters in t] and (p to the shapes expected from electrons and 
photons with high pj ||40|| . These specialised algorithms are used to cluster electromagnetic 
showers without any hypothesis regarding whether the particle originating from the interac- 
tion point was a photon or an electron; doing this for electrons from Z — )■ ee events provides a 
measurement of the photon trigger, reconstruction, and identification efficiencies, as well as of 
the photon energy scale and resolution. The width of the reconstructed Z resonance is used to 
quantify the performance of the ECAL, using decays to two electrons whose energies are mea- 
sured using the ECAL alone, with only their directions being determined from the tracks. In the 
7TeV data set, the dielectron mass resolution at the Z boson mass is 1.56 GeV in the barrel and 
2.57 GeV in the endcaps, while in the 8TeV sample, reconstructed with preliminary calibration 
constants, the corresponding values are 1.61 and 3.75 GeV. For electrons, the reconstruction 
combines the clusters in the ECAL and the trajectory in the silicon tracker [41]. Trajectories 
in the tracker volume are reconstructed using a model of electron energy loss and fitted with 
a Gaussian sum filter ||42| . The electron momentum is determined from the combination of 
ECAL and tracker measurements. 

Muon candidates are reconstructed with two algorithms, one in which the tracks in the silicon 
detector are matched to segments in the muon chambers, and another in which a combined 
fit is performed to the signals found in both the silicon tracker and muon systems [39]. The 
efficiency to reconstruct a muon of px > 5 GeV is larger than 95%, while the probability to 
misidentify a hadron as a muon is below 0.1%. For pj > 200 GeV the precision of the mo- 
mentum measurement improves when the silicon tracker signals are complemented with the 
information from the muon chambers. 

Selection based on isolation of lepton and photon objects is used extensively. The scalar sum of 
the transverse momenta of the particles reconstructed within a distance AR of the object, some- 
times normalised to the pj of the object, is required to not exceed some value. The distance AR 
is defined as AR = y^(A?^P~+~(A^)^, where Arj and A^ are the pseudorapidity and azimuthal 
angle differences between the particle direction and the object direction. Typical values of AR 
used are 0.3 or 0.4. 
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The measurement of the integrated luminosity in CMS is based on a pixel cluster counting 
method, which exploits the large number of silicon pixels, and hence their low occupancy in a 
pp collision [43J. The cross section normalisation is derived from van der Meer scans [44 J. The 
uncertainties on the luminosity measurements are 2.2% and 4.4% for the 7TeV and 8 TeV data 
sets, respectively. 

4 Searches for the standard model Higgs boson 

Initial phenomenological discussions of Higgs boson production and decay can be found in 
Refs. ||45l - l52| . Four main mechanisms are predicted for Higgs boson production in pp colli- 
sions: the gluon-gluon fusion mechanism, which has the largest cross section, followed in turn 
by vector-boson fusion (VBF), associated WH and ZH production (VH), and production in as- 
sociation with top quarks (tfH). The cross sections for the individual production mechanisms 
and the decay branching fractions, together with their uncertainties, have been computed fol- 
lowing Refs. ||53}lZI and are compiled in Refs. 11191 l98ll . 

The particular set of sensitive decay modes of the SM Higgs boson depends strongly on niu. 
The results presented in this paper are based on five most sensitive decay modes in the low- 
mass region: H — > 77; H — > ZZ followed by ZZ decays to Ai; H — > WW followed by decays 
to 2£2v; H — > TT followed by at least one leptonic t decay; and H — > bb followed by b- 
quark fragmentation into jets. This list is presented in Table [l] and comprises the full set of 
decay modes and subchannels, or categories, for which both the 7 and 8 TeV data sets have 
been analysed. Other lower sensitivity subchannels (ttH, H —> bb; W/ZH, H — > tt; W/ZH, 
H -> WW -> 2i2v; H -> ZZ -> 2£2q) have also been studied, so far only in the 7 TeV data, and 
are not included here. Adding these analyses in the combination results in an improvement of 
0.1 (7 in the overall expected local significance at niyi = 125 GeV. 

Table 1: Summary of the subchannels, or categories, used in the analysis of each decay mode. 



Decay 


Production 


No. of 


niii range 


Int. Lum. (fb^^) 


mode 


tagging 


subchannels 


(GeV) 


7 TeV 


8 TeV 


77 


untagged 
dijet (VBF) 


4 

1 or 2 


110-150 


5.1 


5.3 


ZZ 


untagged 


3 


110-600 


5.1 


5.3 


WW 


untagged 
dijet (VBF) 


4 

1 or 2 


110-600 


4.9 


5.1 


TT 


untagged 
dijet (VBF) 


16 
4 


110-145 


4.9 


5.1 


bb 


lepton, E^ss (VH) 


10 


110-135 


5.0 


5.1 



For a given value of nin, the search sensitivity depends on the production cross section, the 
decay branching fraction into the chosen final state, the signal selection efficiency, the mass 
resolution, and the level of background from identical or similar final-state topologies. 

Samples of MC events used to represent signal and background are fully simulated using 
GEANT4 [99]. The simulations include pileup interactions matching the distribution of the 
number of such interactions observed in data. The description of the Higgs boson signal is 
obtained from MC simulation using, for most of the decay modes and production processes, 
the next-to-leading-order (NLO) matrix-element generator POWHEG IllOOillOl l, interfaced with 
PYTHIA 6.4 1,102 J . For the dominant gluon-gluon fusion process, the transverse momentum 
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5 Decay modes with high mass resolution 



spectrum of the Higgs boson in the 7 TeV MC samples is reweighted to the next-to-next-to- 
leading-logarithmic (NNLL) + NLO distribution computed with HqT EZl iS 11031 and Fe- 
HiPro [ 1041 1105L except in the H — > ZZ analysis, where the effect is marginal. The agree- 
ment of the Pt spectrum in the simulation at 8 TeV with the NNLL + NLO distribution makes 
reweighting urmecessary. The improved agreement is due to a modification in the powheg 
setup recommended in Ref. |98|. The simulation of associated-production signal samples uses 
PYTHIA, and all signal samples for H — > bb are made using POWHEG interfaced to HER- 
WIG++ ||106J- Samples used for background studies are generated with pythia, powheg, and 
MadGraph |107( . The uncertainty in the signal cross section related to the choice of parton 
distribution functions is determined with the PDF4LHC prescription ||92] - |96| . 

The overall statistical methodology [108J used in this paper was developed by the CMS and 
ATLAS Collaborations in the context of the LHC Higgs Combination Group. A more concise 
summary of CMS usage in the search for a Higgs boson is given in Ref. |T7|. The modified 
frequentist criterion CLg II109[ I110| is used for the calculation of exclusion limits. Systematic 
uncertainties are incorporated as nuisance parameters and are treated according to the fre- 
quentist paradigm. The combination of searches requires simultaneous analysis of the data 
selected by all individual analyses, accounting for all statistical and systematic uncertainties 
and their correlations. The probability for a background fluctuation to be at least as large as 
the observed maximum excess is termed the local p-value, and that for an excess anywhere in a 
specified mass range the global p-value. This probability can be evaluated by generating sets of 
simulated data incorporating all correlations between analyses optimized for different Higgs 
boson masses. The global p-value (for the specified region) is greater than the local p-value, 
and this fact is often referred to as the look-elsewhere effect (LEE) [lllj. Both the local and 
global p-values can be expressed as a corresponding number of standard deviations using the 
one-sided Gaussian tail convention. The magnitude of a possible Higgs boson signal is char- 
acterised by the production cross section times the relevant branching fractions, relative to the 
SM expectation, denoted a/ osm and referred to as the signal strength. The results presented 
in this paper are obtained using asymptotic formulae |ill2J, including a few updates recently 
introduced in the ROOStats package II113( . 

Figure [l] shows the expected local p-values in the mass range 110-145 GeV for the five decay 
modes reported here. The expected significance of a SM Higgs boson signal at niu = 125 GeV 
when the five decay modes are combined is 5.6 a. The highest sensitivity in this mass range is 
achieved in the ZZ, 77, and WW channels. Because of the excellent mass resolution (1-2 GeV) 
achieved in the 77 and ZZ channels, they play a special role in the low-mass region, where the 
natural width of the SM Higgs boson is predicted to be a few MeV. The expected signature 
in these channels is therefore a narrow resonance above background, with a width consistent 
with the detector resolution. 

5 Decay modes with high mass resolution 

5.1 H ^ 77 

In the H — > 77 analysis a search is made for a narrow peak in the diphoton invariant mass 
distribution in the range 110-150 GeV, on a large irreducible background from QCD production 
of two photons. There is also a reducible background where one or more of the reconstructed 
photons originate from misidentification of jet fragments. Early detailed studies indicated this 
to be one of the most promising channels in the search for a SM Higgs boson in the low-mass 
range l|TT4| . 



5.1 H ^> 77 
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CMS ^ = 7TeV, L = 5.1 fb-' = 8 TeV, L = 5.3 fb'^ 

1a 
2a 

3a 
4a 



5a 



6a 



7a 



110 115 120 125 130 135 140 145 

rriH (GeV) 

Figure 1: Expected local p-values for a SM Higgs boson as a function of nin, for the five decay 
modes and their combination. 

To enhance the sensitivity of the analysis, candidate diphoton events are separated into mutu- 
ally exclusive categories of different expected signal-to-background ratios, based on the prop- 
erties of the reconstructed photons and on the presence of two jets satisfying criteria aimed at 
selecting events in which a Higgs boson is produced through the VBF process. The analysis 
uses multivariate techniques for the selection and classification of the events. As an indepen- 
dent cross-check, an analysis is also performed that is almost identical to the one described in 
Ref. [201, using simpler criteria based on the properties of the reconstructed photons to select 
and classify events. The multivariate analysis achieves 15% higher sensitivity than the cross- 
check analysis. 

The reconstructed primary vertex that most probably corresponds to the interaction vertex of 
the diphoton candidate is identified using the kinematic properties of the tracks associated 
with that vertex and their correlation with the diphoton kinematics. In addition, if either of 
the photons converts and the tracks from the conversion are reconstructed and identified, the 
direction of the converted photon contributes to the identification of the hard-scattering vertex. 
More details can be found in Ref. pO]|. 

The event selection requires two photon candidates satisfying pj requirements and "loose" 
photon identification criteria. These photons must be reconstructed within the fiducial region, 
\rj\ < 2.5, excluding the barrel-endcap transition region, 1.44 < < 1.57. A pj threshold of 
{nijy/A) is applied to the photon leading (sub leading) in pj, where nijj is the diphoton 
invariant mass. Scaling the pj thresholds in this way avoids distortion of the shape of the niyy 
distribution. In the case of events passing the dijet selection, the requirement on the leading 
photon is increased to m^y/l, further reducing background with negligible loss of signal. 

Jet selection criteria are applied to the two jets of largest pj in the event within \rj\ < 4.7. The 
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5 Decay modes with high mass resolution 



jet selection requirements are optimized using simulated VBF signal and diphoton background 
events. The pj thresholds for the two jets are 30 and 20 GeV, and their f] separation is required 
to be greater than 3.5. The dijet invariant mass is required to be greater than 350 and 250 GeV 
for the 7 and 8 TeV data sets, respectively. The lower dijet invariant mass requirement for the 
8 TeV data set reflects the fact that for the analysis of that data set, the dijet event category is 
divided into two to increase the search sensitivity. This division creates a second "tight" dijet- 
tagged category in which the dijet invariant mass must be greater than 500 GeV and both jets 
must have pj > 30 GeV. Two additional selection criteria, relating the dijet to the diphoton 
system, are applied: the difference between the average pseudorapidity of the two jets and 
the pseudorapidity of the diphoton system is required to be less than 2.5, and the difference 
in azimuthal angle between the diphoton system and the dijet system is required to be greater 
than 2.6 radians. 

A multivariate regression is used to extract the photon energy and a photon-by-photon estimate 
of the uncertainty in that measurement. The calibration of the photon energy scale uses the Z 
boson mass as a reference; ECAL showers coming from electrons in Z — ?► ee events are clustered 
and reconstructed in exactly the same way as photon showers. The photon selection efficiency, 
energy resolution, and associated systematic uncertainties are estimated from data, using Z — > 
ee events to derive data/ simulation correction factors. The jet reconstruction efficiency, the 
efficiency to correctly locate the vertex position, and the trigger efficiency, together with the 
corresponding systematic uncertainties, are also evaluated from data. 

For the multivariate analysis, a boosted decision tree (BDT) 11151 1116 1 is trained to give a high 
output value (score) for signal-like events and for events with good diphoton invariant mass 
resolution, based on the following observables: (i) the photon quality determined from elec- 
tromagnetic shower shape and isolation variables; (ii) the expected mass resolution; (iii) the 
per-event estimate of the probability of locating the diphoton vertex within 10 mm of its true 
location along the beam direction; and (iv) kinematic characteristics of the photons and the 
diphoton system. The kinematic variables are constructed so as to contain no information about 
the invariant mass of the diphoton system. The diphoton events not satisfying the dijet selec- 
tion are classified into five categories based on the output of the BDT, with category boundaries 
optimized for sensitivity to a SM Higgs boson. Events in the category with smallest expected 
signal-to-background ratio are rejected, leaving four categories of events. Dijet-tagged events 
with BDT scores smaller than the threshold for the fourth category are also rejected. Simu- 
lation studies indicate that the background in the selected event categories is dominated by 
the irreducible background from QCD production of two photons and that fewer than 30% of 
the diphoton events used in the analysis contain one or more misidentified photons (predomi- 
nantly from 7+jet production). 

Table |2] shows the expected number of signal events in each event category for a SM Higgs 
boson (of mil = 125 GeV), and the background at m-y-y = 125 GeV, estimated from the fit de- 
scribed below. The estimated mass resolution is also shown, measured both by c^eff/ h^lf the 
minimum width containing 68% of the signal events, and by the full width at half maximum 
(FWHM). A large variation in the expected signal-to-background ratio between the categories 
can be seen, although as a consequence of the optimization of the category boundaries the ex- 
pected signal significances in each category are rather similar. The differences in the relative 
signal-to-background ratio between the categories are almost independent of nin. 

The background is estimated from data, without the use of MC simulation, by fitting the dipho- 
ton invariant mass distribution in each of the categories in a range (100 < m-y-y < 180 GeV) 
extending slightly above and below that in which the search is performed. The choices of the 
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Table 2: Expected numbers of SM Higgs boson events (mn = 125 GeV) and estimated back- 
ground (at niyy = 125 GeV) for all event categories of the 7 and 8 TeV data sets. There are 
two dijet-tagged categories for the 8 TeV data as described in the text, and for both data sets 
the remaining untagged events are separated into four categories labelled here BDT 0-3, BDT 
having the largest expected signal-to-background ratio. The composition of the SM Higgs 
boson signal in terms of the production processes, and its mass resolution, are also given. 





Event 


SM Higgs boson expected signal (niyi = 


125 GeV) 


Background 


categories 


Events 


ggH 


VBF 


VH 


ttH 


(GeV) 


FWHM/2.35 
(GeV) 


nijj = 125 GeV 
(events /GeV) 


1 


BDTO 


3.2 


61% 


17% 


19% 


3% 


1.21 
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a 


BDTl 


16.3 


88% 


6% 


6% 




1.26 


1.08 


37.5 ±1.3 




BDT 2 


21.5 


92% 


4% 


4% 




1.59 


1.32 


74.8 ±1.9 


^" 


BDT 3 


32.8 


92% 


4% 


4% 




2.47 


2.07 


193.6 ±3.0 




Dijet tag 


2.9 


27% 


72% 


1% 




1.73 


1.37 


1.7 ±0.2 




BDTO 


6.1 


68% 


12% 


16% 


4% 


1.38 


1.23 


7.4 ± 0.6 


is 


BDTl 


21.0 


87% 


6% 


6% 


1% 


1.53 


1.31 


54.7 ±1.5 


CO 


BDT 2 


30.2 


92% 


4% 


4% 




1.94 


1.55 


115.2 ±2.3 




BDT 3 


40.0 


92% 


4% 


4% 




2.86 


2.35 


256.5 ± 3.4 


% 


Dijet tight 


2.6 


23% 


77% 






2.06 


1.57 


1.3 ±0.2 


00 


Dijet loose 


3.0 


53% 


45% 


2% 




1.95 


1.48 


3.7 ±0.4 



function used to model the background and of the fit range are made based on a study of the 
possible bias on the measured signal strength. Polynomial functions are used. The degree is 
chosen by requiring that the potential bias be at least a factor of 5 smaller than the statistical 
accuracy of the fit prediction. The required polynomial degree ranges from 3 to 5. 

A further independent analysis (referred to as the sideband background model) is performed 
using a different approach to the background modelling. Its sensitivity is very similar to that of 
the standard analysis. It employs a fit to the output of an additional BDT that takes as input the 
diphoton invariant mass and the diphoton BDT output, and uses a background model derived 
from the sidebands of the invariant-mass distribution. A fit to the diphoton invariant-mass 
distribution is used to obtain the background normalisation. This fit is of a power law and 
excludes a window of width ±2% x mn around the mass hypothesis. The methodology allows 
a systematic uncertainty to be assigned to the fit shape. 

The expected 95% CL upper limit on the signal strength a/ a^u, in the background-only hy- 
pothesis, for the combined 7 and 8 TeV data, is less than 1.0 in the range 110 < wjh < 140 GeV, 
with a value of 0.76 at jwh = 125 GeV. The observed limit indicates the presence of a significant 
excess at mn = 125 GeV in both the 7 and 8 TeV data. The features of the observed limit are 
confirmed by the independent sideband-background-model and cross-check analyses. 

The local p-value is shown as a function of niu in Fig. |2] for the 7 and 8 TeV data, and for their 
combination. The expected significance of the local p-value for a SM Higgs boson of mass 
125 GeV is 2.8 cr. The minimum local p-value corresponding to the upward fluctuation of the 
observed limit at 125 GeV corresponds to 4.1 a. The significances of the corresponding mini- 
mum local p-values seen with the sideband-background-model and the cross-check analysis 
are 4.6 c at nin = 125 GeV and 3.7 a at nin = 124 GeV, respectively. The best-fit signal strength 
for a SM Higgs boson mass hypothesis of 125 GeV is a/ ctsm = 1-6 ± 0.4. 

In order to illustrate, in the niyy distribution, the significance given by the statistical methods. 
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5 Decay modes with high mass resolution 



it is necessary to take into account the large differences in the expected signal-to-background 
ratios of the event categories shown in Table |2] The events are weighted according to the cate- 
gory in which they fall. A weight proportional to S/{S + B) is used, as suggested in Ref. fll/l, 
where S and B are the number of signal and background events, respectively, calculated from 
the simultaneous signal-plus-background fit to all categories and integrating over a 2(7^^ wide 
window, in each category, centred on 125 GeV. Figure |3] shows the data, the signal model, and 
the background model, all weighted. The weights are normalised such that the integral of the 
weighted signal model matches the number of signal events given by the best fit. The un- 
weighted distribution, using the same binning but in a more restricted mass range, is shown as 
an inset. The excess at 125 GeV is evident in both the weighted and unweighted distributions. 

CMS ifs = 7 TeV, L = 5. 1 fb^ \fs = 8 TeV, L = 5.3 fb^ 

0) 1 

to 
> 



o 
_] 

10^ 
10^ 



10-^ 



'"110 115 120 125 130 135 140 145 150 

niH (GeV) 

Figure 2: The local p-value as a function of mn in the 77 decay mode for the combined 7 and 
8 TeV data sets. The additional lines show the values for the two data sets taken individually. 
The dashed line shows the expected local p-value for the combined data sets, should a SM 
Higgs boson exist with mass mn- 




5.2 H^ZZ 

In the H — ?► ZZ — )■ A£ decay mode a search is made for a narrow four-lepton mass peak in the 
presence of a small continuum background. Early detailed studies outlined the promise of this 
mode over a wide range of Higgs boson masses [118J. Only the search in the range 110-160 GeV 
is reported here. Since there are differences in the reducible background rates and mass resolu- 
tions between the subchannels 4e, 4f/, and 2e2^, they are analysed separately. The background 
sources include an irreducible four-lepton contribution from direct ZZ production via qq and 
gluon-gluon processes. Reducible contributions arise from Z + bb and tt production where the 
final states contain two isolated leptons and two b-quark jets producing secondary leptons. 
Additional background arises from Z+jets and WZ+jets events where jets are misidentified as 
leptons. Compared to the analysis reported in Ref. Il2ll| , the present analysis employs improved 
muon reconstruction, improved lepton identification and isolation, and a kinematic discrimi- 
nant exploiting the decay kinematics expected for the signal events. An algorithm to recover 
final-state radiation (FSR) photons has also been deployed. 

Electrons are required to have pj > 7 GeV and |f/| < 2.5. The corresponding requirements for 
muons are px > 5 GeV and \ri\ < 2.4. Electrons are selected using a multivariate identifier 
trained using a sample of W+jets events, and the working point is optimized using Z+jets 
events. Both muons and electrons are required to be isolated. The combined reconstruction 
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Figure 3: The diphoton invariant mass distribution with each event weighted by the S/ [S + B) 
value of its category. The Hnes represent the fitted background and signal, and the coloured 
bands represent the ±1 and ±2 standard deviation uncertainties on the background estimate. 
The inset shows the central part of the unweighted invariant mass distribution. 
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5 Decay modes with high mass resolution 



and selection efficiency is measured using electrons and muons in Z boson decays. Muon 
reconstruction and identification efficiency for muons with px < 15 GeV is measured using J/ xp 
decays. 

The electron or muon pairs from Z boson decays are required to originate from the same pri- 
mary vertex. This is ensured by requiring that the significance of the impact parameter with 
respect to the event vertex satisfy |Sip| < 4 for each lepton, where Sjp = 1/ aj, I is the three- 
dimensional lepton impact parameter at the point of closest approach to the vertex, and aj its 
uncertainty. 

Final-state radiation from the leptons is recovered and included in the computation of the 
lepton-pair invariant mass. The FSR recovery is tuned using simulated samples of ZZ — > A£ 
and tested on data samples of Z boson decays to electrons and muons. Photons reconstructed 
within \ tj\ < 2.4 are considered as possibly due to FSR. The photons must satisfy the following 
requirements. They must be within AR < 0.07 of a muon and have Pj>2 GeV (most photon 
showers within this distance of an electron having already been automatically clustered with 
the electron shower); or if their distance from a lepton is in the range 0.07 < AR < 0.5, they 
must satisfy pj > 4 GeV, and be isolated within AR = 0.3. Such photon candidates are com- 
bined with the lepton if the resulting three-body invariant mass is less than 100 GeV and closer 
to the Z boson mass than the mass before the addition of the photon. 

The event selection requires two pairs of same-flavour, oppositely charged leptons. The pair 
with invariant mass closest to the Z boson mass is required to have a mass in the range 40- 
120 GeV and the other pair is required to have a mass in the range 12-120 GeV. The ZZ back- 
ground is evaluated from MC simulation studies. Two different approaches are employed to 
estimate the reducible and instrumental backgrounds from data. Both start by selecting events 
in a background control region, well separated from the signal region, by relaxing the isolation 
and identification criteria for two same-flavour reconstructed leptons. In the first approach, 
the additional pair of leptons is required to have the same charge (to avoid signal contamina- 
tion) while in the second, two opposite-charge leptons failing the isolation and identification 
criteria are required. In addition, a control region with three passing and one failing lepton is 
used to estimate contributions from backgrounds with three prompt leptons and one misiden- 
tified lepton. The event rates measured in the background control region are extrapolated to 
the signal region using the measured probability for a reconstructed lepton to pass the isola- 
tion and identification requirements. This probability is measured in an independent sample. 
Within uncertainties, comparable background counts in the signal region are estimated by both 
methods. 

The number of selected ZZ — > Ai candidate events in the mass range 110 < m^f < 160 GeV, 
in each of the three final states, is given in Table |3| where is the four-lepton invariant 
mass. The number of predicted background events, in each of the three final states, and their 
uncertainties are also given, together with the number of signal events expected from a SM 
Higgs boson of jwh = 125 GeV. The distribution is shown in Fig. |4] There is a clear peak 
at the Z boson mass where the decay Z — ?► 4^" is reconstructed. This feature of the data is well 
reproduced by the background estimation. The figure also shows an excess of events above 
the expected background around 125 GeV. The total background and the numbers of events 
observed in the three bins where an excess is seen are also shown in Table |3l The combined 
signal reconstruction and selection efficiency, with respect to the mn = 125 GeV generated 
signal with niu > 1 GeV as the only cut, is 18% for the 4e channel, 40% for the 4^ channel, and 
27% for the 2e2^ channel. 

The kinematics of the H — ?► ZZ — ?► Ai process, for a given invariant mass of the four-lepton 
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Figure 4: Distribution of the four-lepton invariant mass for the ZZ — )■ 4£ analysis. The 
points represent the data, the filled histograms represent the background, and the open his- 
togram shows the signal expectation for a Higgs boson of mass wjh = 125 GeV, added to the 
background expectation. The inset shows the ma distribution after selection of events with 
Kd > 0.5, as described in the text. 



Table 3: The number of selected events, compared to the expected background yields and ex- 
pected number of signal events (wh = 125 GeV) for each final state in the H — )■ ZZ analysis. The 
estimates of the Z + X background are based on data. These results are given for the mass range 
from 110 to 160 GeV. The total background and the observed numbers of events are also shown 
for the three bins ("signal region") of Fig.lllwhere an excess is seen (121.5 < nii^i < 130.5 GeV). 



Channel 


4e 




2e2^ 




ZZ background 


2.7 ± 0.3 


5.7 ±0.6 


7.2 ± 0.8 


15.6 ± 1.4 


Z + X 






7 a+1.8 


4 4+2.2 


All backgrounds (110 < m^/, < 160 GeV) 


4.0 ±1.0 


6.6 ± 0.9 


9.7 ±1.8 


20 ±3 


Observed (110 < m^f < 160 GeV) 


6 


6 


9 


21 


Signal (ffiH = 125 GeV) 


1.36 ± 0.22 


2.74 ± 0.32 


3.44 ± 0.44 


7.54 ± 0.78 


All backgrounds (signal region) 


0.7 ±0.2 


1.3 ±0.1 


1.9 ±0.3 


3.8 ±0.5 


Observed (signal region) 


1 


3 


5 


9 
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6 Decay modes with low mass resolution 



system, is fully described by five angles and the invariant masses of the two lepton pairs ||119l - 
I12H . These seven variables provide significant discriminating power between signal and back- 
ground. A kinematic discriminant is constructed based on the probability ratio of the signal 
and background hypotheses, = Vsig/ {'Psig + ^bkg)/ described in Ref. |I122| . The likeli- 
hood ratio is defined for each value of m4£. For the signal, the phase-space and Z propagator 
terms II123II are included in a fully analytic parameterization [120], while the background prob- 
ability is tabulated using a simulation of the qq — > ZZ/Z7 process. The statistical analysis only 
includes events with m4(? > 100 GeV. 

Figure |5] (upper) shows the distribution of 1714^^ versus Ko for events selected in the Ai sub- 
channels. The colour-coded regions show the expected background. Figure |5] (lower) shows 
the same two-dimensional distribution of events, but this time superimposed on the expected 
event density from a SM Higgs boson (wjh = 125 GeV). A clustering of events is observed 
around 125 GeV with a high score in the kinematic discriminant, where the background ex- 
pectation is low and signal expectation high, corresponding to the excess seen in the one- 
dimensional mass distribution. The m^^f distribution of events satisfying > 0-5 is shown 
in the inset in Fig. |4] 

There are three final states and two data sets (7 and 8 TeV), and thus the statistical treatment 
requires six simultaneous two-dimensional maximum-likelihood fits for each value of jwh, in 
the variables and Kq. Systematic uncertainties are evaluated from data for the trigger ef- 
ficiency and for the combined lepton reconstruction, identification, and isolation efficiencies. 
Systematic uncertainties on the energy/ momentum calibration and on the energy resolution 
are estimated from data. Additional systematic uncertainties arise from limited statistical pre- 
cision in the reducible background control regions. 

The expected 95% CL upper limit on the signal strength a/ asu, in the background-only hy- 
pothesis, for the combined 7 and 8 TeV data, falls steeply between 110 and 140 GeV, and has a 
value of 0.6 at wjh = 125 GeV. The observed upper limit indicates the presence of a significant 
excess in the range 120 < nin < 130 GeV. The local p-value is shown as a function of nin in 
Fig. |6] for the 7 and 8 TeV data, and for their combination. The minimum local p-value in the 
data occurs at Wh = 125.6 GeV and has a significance of 3.2 a (expected 3.8 a). The combined 
best-fit signal strength for a SM Higgs boson mass hypothesis of 125.6 GeV is a/ osm = O-^^q 3- 

6 Decay modes with low mass resolution 

6.1 H WW 

The decay mode H — ?► WW was first proposed as a promising search channel for study at the 
LHC in Ref. I|124| . It is highly sensitive to a SM Higgs boson in the mass range around the 
WW threshold of 160 GeV. With the development of tools for lepton identification and £^'*^ re- 
construction optimized for LHC pileup conditions, it is possible to extend the sensitivity down 
to 120 GeV. This decay mode is analysed by selecting events in which both W bosons decay 
leptonically resulting in a signature with two isolated, oppositely charged leptons (electrons 
or muons) and large E^'** due to the undetected neutrinos. A pj threshold of 20 (10) GeV is 
applied to the lepton leading (subleading) in pj. The analysis of the 7 TeV data is described 
in Ref. ||22| and remains unchanged, while the 8 TeV analysis was modified to cope with more 
difficult conditions induced by the higher pileup of the 2012 data taking. 

Events are classified according to the number of jets (0, 1, or 2) with pj > 30 GeV and within 
Ir/I < 4.7 (I //I < 5.0 for the 7 TeV data set), and further separated into same-flavour (ee and /Yf/) 
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Figure 5: The distribution of ever\ts selected in the 4i subchannels for the kinematic discrim- 
inant, Ko, versus m^f . Events in the three final states are marked by filled symbols (defined 
in the legend). The horizontal error bars indicate the estimated mass resolution. In the upper 
plot the colour-coded regions show the background expectation; in the lower plot the colour- 
coded regions show the event density expected from a SM Higgs boson (wjh = 125 GeV) (both 
in arbitrary units). 
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6 Decay modes with low mass resolution 
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Figure 6: The observed local p-value for the ZZ decay mode as a function of the SM Higgs 
boson mass. The dashed line shows the expected local p-values for a SM Higgs boson with a 
mass wzh- 



or different-flavour (e^) categories. Events with more than two jets are rejected. To improve the 
sensitivity of the analysis, the selection criteria are optimized separately for the different event 
categories since they are characterised by different dominating backgrounds. The zero-jet eji 
category has the best signal sensitivity. Its main backgrounds are irreducible nonresonant WW 
production and reducible W+jets processes, where a jet is misidentified as a lepton. The one-jet 
e}i and zero-jet same-flavour categories only contribute to the signal sensitivity at the 10% level 
because of larger backgrounds, from top-quark decays and Drell-Yan production, respectively. 
Event selection in the two-jet category is optimized for the VBF production mechanism. This 
category has the highest expected signal-to-background ratio, but its contribution to the overall 
sensitivity is small owing to the lower cross section relative to inclusive production. 

The projected E^'** variable Il22| is used to reduce the Drell-Yan background arising from 
events where the £^'^® vector is aligned with the lepton pj, as well as events with mismeasured 
£^^*® associated with poorly reconstructed leptons and jets. The projected £™^^ is defined as 
the transverse component of the E^^^^ vector with respect to the closest lepton direction, if it is 
closer than n/2 in azimuthal angle, or the full E^^^^ otherwise. Since the projected £^^*^ resolu- 
tion is deteriorated by pileup, the minimum of two different projected £^'^® definitions is used: 
the first includes all particle candidates in the event, while the second uses only the charged 
particle candidates associated with the primary vertex. In the 8 TeV analysis, the minimum 
projected £™'®® defined in this way is then required to be above a threshold that varies by cate- 
gory. For niii > 140 GeV, projected £^^^^ is required to be greater than 20 GeV in the efi channel, 
and greater than 45 GeV in the same-flavour channels. For jwh < 140 GeV in the same-flavour 
channels, where it is more difficult to separate the signal from the Drell-Yan background, a 
multivariate selection is used, combining kinematic and topological variables. In the two-jet 
category, a simple selection of £^'** > 45 GeV is applied. To further reduce the Drell-Yan back- 
ground in the same-flavour final states, events with a dilepton mass within 15 GeV of the Z 
boson mass are rejected. The background from low-mass resonances is rejected by requiring a 
dilepton invariant mass greater than 12 GeV. 

To suppress the top-quark background, a "top tagging" technique based on soft-muon and b- 
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jet tagging is applied. The first method is designed to veto events containing muons from b jets 
coming from decays of top quarks. The second method uses a b-jet tagging algorithm, which 
looks within jets for tracks with large impact parameters. The algorithm is applied also in the 
case of zero-jet events, which may contain low-px jets below the selection threshold. To reduce 
the background from WZ production, events with a third lepton passing the identification and 
isolation requirements are rejected. 

Yields for the dominant backgrounds are estimated using control regions in the data. The 
W+jets contribution is derived from data using a "tight-loose" sample in which one lepton 
passes the standard criteria and the other does not, but instead satisfies a "loose" set of require- 
ments. The efficiency eioose for ^ jet that satisfies the loose selection to pass the tight selection 
is determined using data from an independent loose lepton-trigger sample dominated by jets. 
The background contamination is then estimated using the events of the "tight-loose" sample 
weighted by eioose/ (1 ~ ^loose)- The normalisation of the top-quark background is estimated 
by counting the number of top-tagged events and applying the corresponding top-tagging effi- 
ciency. The nonresonant WW contribution is normalised by using events with a dilepton mass 
larger than 100 GeV, where the Higgs boson signal contamination is negligible, extrapolated 
to the signal region using simulated samples. The same-flavour Drell-Yan background is nor- 
malised using the number of events observed with a dilepton mass within 7.5 GeV of the Z 
boson mass, after subtracting the non-Drell-Yan contribution. Other minor backgrounds from 
WZ, ZZ, and W7 are estimated from simulation. 

The 7 TeV data are analysed by training a BDT for each Higgs boson mass hypothesis in the 
zero-jet and one-jet event categories, while a simple selection strategy is employed in the VBF 
category [22]. In the BDT analysis, the Higgs boson signal is separated from the background by 
using a binned maximum-likelihood fit to the classifier distribution. The 8 TeV analysis is based 
on a simple selection strategy optimized for each mass hypothesis, where additional kinematic 
and topological requirements are applied to improve the signal-to-background ratio. One of 
the most sensitive variables to discriminate between H — > WW decays and nonresonant WW 
production is the dilepton invariant mass m^^, which has a resolution of about 20%. This quan- 
tity is shown in Fig. for the zero-jet e/Y category after the full selection for mn = 125 GeV, 
except for the selection on nifj itself. Table |4] shows for the 8 TeV analysis the number of events 
selected in data, background estimates, and signal predictions for jwh = 125 GeV in each anal- 
ysis category after applying all the selection requirements. About 97% of the signal events 
selected in the zero-jet efi category are expected to be produced by the gluon-gluon fusion pro- 
cess, whereas 83% of the signal in the two-jet category is expected to be produced by the 
VBF process. The 95% CL expected and observed limits for the combination of the 7 and 8 TeV 
analyses are shown in Fig. |8] A broad excess is observed across the mass range. Given the 
modest mass resolution in this channel, this excess is consistent with a SM Higgs boson with 
a mass around 125 GeV. This is illustrated by the dotted curve in Fig. |8] showing the median 
expected limit under the hypothesis of the presence of a SM Higgs boson with mass 125 GeV. 
The expected significance for a SM Higgs of mass 125 GeV is 2.4 a and the observed significance 
is 1.6 (7. 

6.2 H ^ TT 

The decay mode H — ?► tt is searched for in four exclusive subchannels, corresponding to dif- 
ferent decays of the r pair: eft, er^, and }iT\y, where electrons and muons arise from leptonic 
T decays, and Th denotes hadronic t decays. The latter are reconstructed by selecting t decays 
consistent with the hypothesis of three charged pions, or one charged pion and up to two neu- 
tral pions ni25[ . The search is made in the mass range 110-145 GeV, and a signal should appear 
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Table 4: Observed number of events, background estimates and signal predictions for = 
125 GeV in each category of the WW analysis of the 8 TeV data set. All the selection require- 
ments have been applied. The combined experimental and theoretical, systematic and statis- 
tical uncertainties are shown. The Z7 process includes the dimuon, dielectron, and rr — ?■ II 
final states. 



Category: 


0-jet e}i 


0-jet U 


1-jet e}L 


1-jet U 


2-jet e}i 


2-jet U 


WW 


87.6 ± 9.5 


60.4 ± 6.7 


19.5 ±3.7 


9.7 ±1.9 


0.4 ±0.1 


0.3 ±0.1 


WZ + ZZ + Z7 


2.2 ±0.2 


37.7 ± 12.5 


2.4 ±0.3 


8.7 ±4.9 


0.1 ±0.0 


3.1 ±1.8 


Top 


9.3 ±2.7 


1.9 ± 0.5 


22.3 ± 2.0 


9.5 ±1.1 


3.4 ±1.9 


2.0 ±1.2 


W+jets 


19.1 ±7.2 


10.8 ± 4.3 


11.7 ±4.6 


3.9 ±1.7 


0.3 ±0.3 


0.0 ±0.0 


W7(*) 


6.0 ±2.3 


4.6 ± 2.5 


5.9 ±3.2 


1.3 ±1.2 


0.0 ±0.0 


0.0 ±0.0 


All backgrounds 


124.2 ± 12.4 


115.5 ± 15.0 


61.7 ±7.0 


33.1 ±5.7 


4.1 ±1.9 


5.4 ±2.2 


Signal (ffiH = 125 GeV) 
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4.4 ±1.3 
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0.8 ±0.1 
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Figure 7: Distribution of nifx for the zero-jet e}i category in the H — ?► WW search at 8 TeV. 
The signal expected from a Higgs boson with a mass OTh = 125 GeV is shown added to the 
background. 
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Figure 8: The 95% CL limit on a/ Csm for a Higgs boson decaying, via a W boson pair, to 
two leptons and two neutrinos, for the combined 7 and 8 TeV data sets. The symbol cr/ a^M 
denotes the production cross section times the relevant branching fractions, relative to the SM 
expectation. The background-only expectations are represented by their median (dashed line) 
and by the 68% and 95% CL bands. The dotted curve shows the median expected limit for a 
SM Higgs boson with Mh = 125 GeV. 
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6 Decay modes with low mass resolution 



as a broad excess in the distribution of the T-pair invariant mass tUtt- 

The sensitivity of the search is improved by classifying the events according to jet multipHcity 
and the transverse momentum of the reconstructed t. The multiplicity of jets with pj > 30 GeV 
reflects the production mechanism: events with zero or one jet are likely to come from the 
gluon-gluon fusion process, while events with two jets are candidates for VBF production. 
Events including b jets with px > 20 GeV are removed from zero- and one-jet categories. The 
signal purities in the zero- and one-jet categories are increased, and the rriTT resolution is im- 
proved, by separating events into low- and high-px subchannels. The high-px subchannels are 
defined by > 40 GeV in channels with a hadronic T decay, and pj > 35 (30) GeV in the 
e}i {fiji) channel. The mass m^T is reconstructed with an algorithm II126I combining the visible 
T decay products and the missing transverse energy, achieving a resolution of about 20% on 
mT-T-. Figure |9] shows as an example the reconstructed Wt-t distribution in the f/Th VBF category 
for the combined 7 and 8 TeV data samples. 
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Figure 9: Distribution of wJtt in the combined 7 and 8 TeV data sets for the f/Th VBF category of 
the H — > TT search. The signal expected from a SM Higgs boson (jwh = 125 GeV) is added to 
the background simulation. 



Backgrounds in the ej^ and fiji channels arise from tf and Drell-Yan production, while W and 
Z production with a misidentified candidate from an electron, muon, or jet dominates in 
the hadronic channels. Backgrounds from Z — > tt decays are modelled with Z — > j-iji events 
in data where each muon is replaced with particles from simulated decays of a t with the 
same momentum as the muon. Reducible backgrounds, comprising W+jets, QCD multijet 
production, and residual Z — > ee events, are estimated from the data. An improved signal-to- 
background ratio is achieved by including explicitly in the event selection for the VBF produc- 
tion mechanism the pseudorapidity separation between forward jets and the large invariant 
mass of the dijet system. Table |5] shows the numbers of expected and observed events in the 
most sensitive event categories (VBF) for the 7 and 8 TeV data sets. The expected signal yields 
for a SM Higgs boson with OTh = 125 GeV are also shown. 
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Table 5: Numbers of expected and observed events in the most sensitive event categories (VBF) 
in the H — > tt analysis for the 7 and 8 TeV data sets. The expected signal yields for a SM Higgs 
boson with jwh = 125 GeV are also shown. Combined statistical and systematic uncertainties 
on each estimate are reported. 



Subchannel 






e}i 


m 


Z->TT 


53 ± 5 


100 ± 9 


56 ± 12 


5.3 ± 0.4 


QCD 


35 ± 7 


41 ± 9 


7.4 ±1.4 




W+jets 


46 ± 10 


72 ± 15 






Z+jets 


13 ± 2 


2.5 ± 0.6 














70 ± 8 


tt 


70 ± 1.7 


14 ± 3 


24 ± 2 


6.7 ± 1.5 


Dibosons 


1.2 ± 0.9 


2.9 ± 2.1 


11 ± 2 


2.4 ± 0.9 


All backgrounds 


156 ± 13 


233 ± 20 


99 ± 13 


85 ± 9 


Signal (7«H = 125 GeV) 


4.3 ± 0.6 


7.7 ± 1.1 


3.5 ± 0.4 


0.8 ± 0.1 


Data 


142 


263 


110 


83 



CMS H ^ XT \^ = 7 TeV, L = 4.9 fb"^ \^ = 8 TeV, L = 5.1 fb"^ 
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Figure 10: The 95% CL limit on the signal strength a/ (Tsm for a Higgs boson decaying to r 
pairs, for the combined 7 and 8 TeV data sets. The symbol a/ ctsm denotes the production cross 
section times the relevant branching fractions, relative to the SM expectation. The background- 
only expectations are represented by their median (dashed line) and by the 68% and 95% CL 
bands. 
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6 Decay modes with low mass resolution 



To search for the presence of a Higgs boson signal in the selected events, a binned maximum- 
likelihood fit to niTr is performed jointly across the four final states, each with five event cate- 
gories. Systematic uncertainties are represented by nuisance parameters in the fitting process. 
The expected and observed 95% CL limits on the signal strength for the combination of all cat- 



egories are shown in Fig. 10 The expected and observed limits are 1.3 and 1.1 times the SM 
Higgs boson cross section at mass 125 GeV, respectively. The expected significance for a SM 
Higgs boson of mass 125 GeV is 1.4 a, and the observed value is zero a. 

6.3 H ^ bb 

For mn < 135 GeV, the decay H — > bb has the largest branching fraction of the five search 
modes, but the signal is overwhelmed by QCD production of bottom quarks. The analysis is 
therefore designed to search for the associated production of the Higgs boson in events where 
a dijet resonance is produced at high pj in association with a W or Z boson; this largely sup- 
presses the QCD background. Five independent search channels are explored corresponding 
to different decays of the vector boson: Z{£i)li, Z(t/t/)H, and W(£i/)H. Events are further sep- 
arated into two categories based on the pj of the vector boson, ranging from 50-100 GeV for 
the lowest bin in the Z{ii) search, to greater than 170 GeV for the highest bin in the W{£v) 
search. For the Z{vv) search, two subchannels are defined as 120 < £^'** < 160 GeV and 
£miss y QgV. The two jets comprising the candidate Higgs boson decay are required to be 
identified as b jets, and the dijet system must satisfy a pj threshold that is optimized within 
each channel. 

Dominant backgrounds arise from production of vector bosons in association with jets, pair- or 
single-production of top quarks, and diboson production (WW, WZ, ZZ) with one of the bosons 
decaying hadronically. Significant background rejection is achieved in general by requiring 
large pj for the dijet, while also requiring that there be minimal additional jet activity and that 
the vector boson and dijet be back-to-back in azimuth. The effect on the signal efficiency of 
this selection due to higher-order electroweak II127I and QCD IISTll corrections are accounted 
for in the systematic uncertainties. Further signal discrimination is obtained from the dijet 
invariant mass, which is expected to peak near nin. A multivariate regression algorithm to 
better estimate b-jet pj is trained on jets in simulated signal events and achieves a final dijet 
mass resolution of 8-9% for mn = 125 GeV. The performance of the regression algorithm is 
checked in data using W/Z+jets and tt events. 

A search for the signal is made in the distribution of scores of a BDT trained at discrete mass 
points. Input variables to the BDT algorithm exploit kinematic and topological information 
about the vector boson and dijet systems, and the colour-singlet nature of the Higgs boson 1 128 1 . 
The distribution of scores in simulated background events is checked using control regions in 



the data designed to enrich individual background contributions. Figure 11 shows as an exam- 
ple the BDT scores for the high-pj subchannel of the Z{vv) search in the 8 TeV data set, after all 
selection criteria have been applied. 

The rates for the dominant backgrounds arising from production of W/Z+jets and top-quark 
pairs are estimated in data, while contributions from single-top and diboson production are 
estimated from simulation studies. The signal is then searched for as an excess in the BDT score 
distribution using the predicted shapes for signal and background events, for Higgs boson 
masses in the range 110-135 GeV. 

Combined results for expected and observed 95% CL limits obtained from the 7 and 8 TeV data 



sets are displayed in Fig. 12 The expected and observed limits are 1.6 and 2.1 times the SM 



Higgs boson cross section at mass 125 GeV. The expected local p-value for a SM Higgs of mass 
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CMS (s = 8 TeV, L = 5.0 fb'^ 
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Figure 11: Distribution of BDT scores for the high-px subchannel of the Z(t/t/)H(bb) search 
in the 8 TeV data set after all selection criteria have been applied. The signal expected from 
a Higgs boson (wh = 125 GeV), including W(£i/)H events where the charged lepton is not 
reconstructed, is shown added to the background and also overlaid for comparison with the 
diboson background. 
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7 Combined results 



125 GeV corresponds to 1.9 a, while the observed one corresponds to 0.7 a. 



CMS H ^ bb 6 = 7 TeV, L = 5.0 fb' ^ = 8 TeV, L = 5.1 fb"' 




110 115 120 125 130 135 
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Figure 12: The 95% CL limit on the signal strength a/ (7gM for a Higgs boson decaying to two b 
quarks, for the combined 7 and 8 TeV data sets. The symbol a/ ac,M denotes the production cross 
section times the relevant branching fractions, relative to the SM expectation. The background- 
only expectations are represented by their median (dashed line) and by the 68% and 95% CL 
bands. 



7 Combined results 

The individual results for the channels analysed for the five decay modes, summarised in Ta- 
ble [ij are combined using the methods outlined in Section |4] The combination takes into ac- 
count the experimental statistical and systematic uncertainties as well as the theoretical uncer- 
tainties, which are dominated by the imperfect knowledge of the QCD scale and parton distri- 



bution functions. The CLg is shown in Fig. 13 as a function of the Higgs boson mass hypothesis. 
The observed values are shown by the solid points. The dashed line indicates the median of the 
expected results for the background-only hypothesis, with the green (dark) and yellow (light) 
bands indicating the ranges in which the CLg values are expected to lie in 68% and 95% of the 
experiments under the background-only hypothesis. The probabilities for an observation, in 
the absence of a signal, to lie above or below the 68% (95%) band are 16% (2.5%) each. The 
thick horizontal lines indicate CLg values of 0.05, 0.01, and 0.001. The mass regions where the 
observed CLg values are below these lines are excluded with the corresponding (1 — CLg) con- 
fidence levels. Our previously published results exclude the SM Higgs boson from 127 up to 
600 GeV [17]. In the search described here, the SM Higgs boson is excluded at 95% CL in the 
range 110 < mn < 121.5 GeV. In the range 121.5 < jwh < 128 GeV a significant excess is seen 
and the SM Higgs boson cannot be excluded at 95% CL. 



7.1 Significance of tine observed excess 
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CMS ^ = 7 TeV, L = 5.1 fb"' \(s = 8 TeV, L = 5.3 fb"' 




rriH (GeV) 

Figure 13: The CLg values for the SM Higgs boson hypothesis as a function of the Higgs boson 
mass in the range 110-145 GeV. The background-only expectations are represented by their 
median (dashed line) and by the 68% and 95% CL bands. 



7.1 Significance of the observed excess 

The consistency of the observed excess with the background-only hypothesis may be judged 



from Fig. 14 which shows a scan of the local p-value for the 7 and 8 TeV data sets and their 
combination. The 7 and 8 TeV data sets exhibit an excess of 3.2 a and 3.8 a significance, re- 
spectively, for a Higgs boson mass of approximately 125 GeV. In the overall combination the 
significance is 5.0 a for wjh = 125.5 GeV. Figure [15] gives the local p-value for the five decay 
modes individually and displays the expected overall p-value. 

The largest contributors to the overall excess in the combination are the 77 and ZZ decay 
modes. They both have very good mass resolution, allowing good localization of the invariant 
mass of a putative resonance responsible for the excess. Their combined significance reaches 



5.0 a (Fig. 16 1. The WW decay mode has an exclusion sensitivity comparable to the 77 and ZZ 
decay modes but does not have a good mass resolution. It has an excess with local significance 
1.6(7 for »jh ~ 125 GeV. When added to the 77 and ZZ decay modes, the combined signifi- 
cance becomes 5.1 a. Adding the bb and tt channels in the combination, the final significance 
becomes 5.0 c. Table |6] summarises the expected and observed local p-values for a SM Higgs 
boson mass hypothesis of 125.5 GeV for the various combinations of channels. 

Table 6: The expected and observed local p-values, expressed as the corresponding number of 
standard deviations of the observed excess from the background-only hypothesis, for mn = 
125.5 GeV, for various combinations of decay modes. 



Decay mode/ combination 


Expected {a) 


Observed {a) 


77 


2.8 


4.1 


ZZ 


3.6 


3.1 


TT + bb 


2.4 


0.4 


77 + ZZ 


4.7 


5.0 


77 + ZZ + WW 


5.2 


5.1 


77 + ZZ + WW + TT + bb 


5.8 


5.0 



The global p-value for the search range 115-130 (110-145) GeV is calculated using the method 
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Figure 14: The observed local p-value for 7 TeV and 8 TeV data, and their combination as a 
function of the SM Higgs boson mass. The dashed line shows the expected local p-values for a 
SM Higgs boson with a mass oth- 




(GeV) 



Figure 15: The observed local p-value for the five decay modes and the overall combination as 
a function of the SM Higgs boson mass. The dashed line shows the expected local p-values for 
a SM Higgs boson with a mass wh- 



7.2 Mass of the observed boson 
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Figure 16: The observed local p-value for decay modes with high mass-resolution channels, 77 
and ZZ, as a function of the SM Higgs boson mass. The dashed line shows the expected local 
p-values for a SM Higgs boson with a mass mn- 



suggested in Ref. and corresponds to 4.6 (A.Bcr). These results confirm the very low 

probability for an excess as large or larger than that observed to arise from a statistical fluctu- 
ation of the background. The excess constitutes the observation of a new particle with a mass 
near 125 GeV, manifesting itself in decays to two photons or ZZ. These two decay modes in- 
dicate that the new particle is a boson; the two-photon decay implies that its spin is different 
from one IIT29llT30ll . 

7.2 Mass of the observed boson 

The mass mx of the observed boson is determined using the 77 and ZZ decay modes, with 
the former dominating the precision of the measurement. The calibration of the energy scale 
in the 77 decay mode is achieved with reference to the known Z boson mass, as described 
in Section |5.1[ There are two main sources of systematic uncertainty: (i) imperfect simulation 
of the differences between electrons and photons and (ii) the need to extrapolate from niz to 
Wh ~ 125 GeV. The systematic uncertainties are evaluated by making comparisons between 
data and simulated samples of Z — > ee and H — > 77 (wh = 90 GeV). The two uncertainties, 
which together amount to 0.5%, are assumed to be fully correlated between all the 77 event 
categories in the 7 and 8 TeV data. For the ZZ — > Ai decay mode the energy scale (for electrons) 
and momentum scale (for muons) are calibrated using the leptonic decays of the Z boson, with 
an assigned uncertainty of 0.4%. 

Figure [l7| shows the two-dimensional 68% CL regions for the signal strength a/ asm versus nix 
for the three channels (untagged 77, dijet-tagged 77, and ZZ — ?> Ai). The combined 68% CL 



contour shown in Fig. 17 assumes that the relative event yields among the three channels are 



those expected from the standard model, while the overall signal strength is a free parameter. 



To extract the value of nix in a model-independent way, the signal yields of the three channels 
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8 Conclusions 



are allowed to vary independently. Thus the expected event yields in these channels are scaled 
by independent factors, while the signal is assumed to be due to a particle with a unique mass 
mx- The combined best-fit mass is nix = 125.3 ± 0.4 (stat.) ± 0.5 (syst.) GeV. 

7.3 Compatibility with the SM Higgs boson hypothesis 

A first test of the compatibility of the observed boson with the SM Higgs boson is provided 
by examination of the best-fit value for the common signal strength cr/ Csm/ obtained in a com- 



bination of all search channels. Figure 18 shows a scan of the overall a/ ctsm obtained in the 
combination of all channels versus a hypothesised Higgs boson mass jwh- The band corre- 
sponds to the ±1 c uncertainty (statistical and systematic). The excesses seen in the 7TeV and 
8 TeV data, and in their combination, around 125 GeV are consistent with unity within the ±1 cr 
uncertainties. The observed a/ crc,M value for an excess at 125.5 GeV in a combination of all 
data is 0.87 ± 0.23. The different decay channels and data sets have been examined for self- 



consistency. Figure 19 shows the measured values of a/ (Tsm results obtained for the different 
decay modes. These results are consistent, within uncertainties, with the expectations for a SM 
Higgs boson. 
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Figure 17: The 68% CL contours for the signal strength a/ Csm versus the boson mass mx for the 
untagged 77, 77 with VBF-like dijet, Ai, and their combination. The symbol cr/ ctsm denotes the 
production cross section times the relevant branching fractions, relative to the SM expectation. 
In this combination, the relative signal strengths for the three decay modes are constrained by 
the expectations for the SM Higgs boson. 



8 Conclusions 

Results are presented from searches for the standard model Higgs boson in proton-proton col- 
lisions at a/s = 7 and 8 TeV in the CMS experiment at the LHC, using data samples corre- 
sponding to integrated luminosities of up to 5.1 fb^^ at 7 TeV and 5.3 fb^^ at 8 TeV. The search 
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Figure 18: The observed best-fit signal strength a/ ac,M as a function of the SM Higgs boson 
mass in the range 110-145 GeV for the combined 7 and 8 TeV data sets. The symbol cr/asM 
denotes the production cross section times the relevant branching fractions, relative to the SM 
expectation. The band corresponds to the ±1 standard deviation uncertainty in a/ usm- 
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CMS ^ = 7 TeV. L = 5. 1 fb'' {s^8 TeV, L = 5.3 fb'^ 

125.5 GeV 




Figure 19: Values of a/ asu for the combination (solid vertical line) and for individual decay 
modes (points). The vertical band shows the overall a/ crc,M value 0.87 ± 0.23. The symbol 
a/ denotes the production cross section times the relevant branching fractions, relative to 
the SM expectation. The horizontal bars indicate the ±1 standard deviation uncertainties on the 
cr/ c^sM values for individual modes; they include both statistical and systematic uncertainties. 
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is performed in five decay modes: 77, ZZ, W^W^, t+t^, and bb. An excess of events is 
observed above the expected background, with a local significance of 5.0 a, at a mass near 
125 GeV, signalling the production of a new particle. The expected local significance for a 
standard model Higgs boson of that mass is 5.8 cr. The global p-value in the search range of 
115-130 (110-145) GeV corresponds to 4.6 (4.5 c^). The excess is most significant in the two 
decay modes with the best mass resolution, 77 and ZZ, and a fit to these signals gives a mass 
of 125.3 ± 0.4 (stat.) ± 0.5 (syst.) GeV. The decay to two photons indicates that the new parti- 
cle is a boson with spin different from one. The results presented here are consistent, within 
uncertainties, with expectations for a standard model Higgs boson. The collection of further 
data will enable a more rigorous test of this conclusion and an investigation of whether the 
properties of the new particle imply physics beyond the standard model. 
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